Introducing MultiScale technique with CACM-RL

نویسندگان
چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

14:38] awvanderhoek: CACM

Any software development project uses software tools that assist its developers in coordinating their efforts. These tools, termed coordination technologies in this paper, have undergone remarkable changes over time in the functionality that they offer. We contribute a novel perspective on this historical trend with our Coordination Pyramid, a framework that recognizes four distinct paradigms o...

متن کامل

Extending XML-RL with Update

With the extensive use of XML in applications over the Web, how to update XML data is becoming an important issue because the role of XML has expanded beyond traditional applications, in which XML is used as a mean for data representation and exchange on the Web. This paper presents a novel declarative XML update language which is an extension of the XML-RL query language. Compared with other e...

متن کامل

Off-Environment RL with Rare Events

Policy gradient methods have been widely applied in reinforcement learning. For reasons of safety and cost, learning is often conducted using a simulator. However, learning in simulation does not traditionally utilise the opportunity to improve learning by adjusting certain environment variables – state features that are randomly determined by the environment in a physical setting but controlla...

متن کامل

XML-RL Update Language

Supporting for updating XML documents has recently attracted interest. This paper presents a novel declarative XML update language, which is an extension of the XML-RL query language. We define XML-RL update syntax and a set of primitive update operations to fully evolve XML into universal data representation and sharing format.

متن کامل

Soar-RL: integrating reinforcement learning with Soar

In this paper, we describe an architectural modification to Soar that gives a Soar agent the opportunity to learn statistical information about the past success of its actions and utilize this information when selecting an operator. This mechanism serves the same purpose as production utilities in ACT-R, but the implementation is more directly tied to the standard definition of the reinforcemen...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal of Advanced Robotic Systems

سال: 2017

ISSN: 1729-8814,1729-8814

DOI: 10.1177/1729881417694289